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METHOD FOR PREPARING FOREIGN PROTEIN IN YEAST, RECOMBINANT DNA, TRANSFORMANT 



FIELD OF THE INVENTION 



s This invention relates to a method for preparing foreign protein in yeast using an expression recom- 
binant DNA comprising DNA encoding the serum albumin signal peptide adjacent to DNA encoding the 
foreign protein. 



BACKGROUND OF THE INVENTION 



In the production of specific proteins in a recombinant host by recombinant DNA technology, there are 
many advantages to having the host express and secrete the desired protein. That is, when a desired 

rs protein is expressed directly within the host cell, if there is any toxicity which inhibits growth or 
compromises the survival of the host ceil, this toxicity can be avoided by the secretion of the protein. Even 
when there is no toxicity, as the protein accumulates in the host cell, it may inhibit the host cell growth. 
This, too, can be avoided by secretory expression. In addition, systems which accumulate protein in the 
host cell may also denature it, rendering it insoluble. This problem also can be avoided by secretory 

20 expression. Moreover, when commercially producing protein by recombinant DNA technology in a system 
which accumulates the desired protein intra cellularly, it is necessary to destroy the cell in order to refine 
the protein, and it must be purified from the debris of the cellular destruction. This makes it difficult to 
obtain a protein of high purity. On the other hand, when producing a protein by a secretory expression 
system the protein only must be harvested from the culture broth, minimizing the problem of separating 

25 impurities derived from the recombinant host This is a great advantage. Finally, most protein undergoes 
some modification, such as the addition of a sugar moiety, the formation of a disulfide bond, activation by 
limited hydrolysis of the inert proprotein, phosphorylation of specific amino acids, or carboxylation before 
activation. Some of these functions are performed by the themselves, and several of these modifications 
take place in the process of secretion. Therefore, a system which produces protein by secretory expression, 

30 as compared to a system which accumulates protein intracellular^, may be expected to generate proteins 
having a structure and function much close to the native protein. 

Somethings are known about the properties of the signal peptide, and the characteristics of its amino 
acid sequence seem to be as follows. There are many basic amino acids near the N-terminal, and there are 
many polar amino acids near the portion which is digested by signal peptidase on the C-terminal side, while 

36 a sequence hydrophobic amino acids fill in the space between these two areas. The basic amino acids near 
the N-terminal interact with the phospholipids on the internal surface of the cell membrane, and the 
sequence of hydrophobic amino acids in the middle region playes an important role in passing the protein 
through the cell membrane. The polar amino acids at the C-terminal are believed to play some role in 
recognition during digestion by signal peptidase. These characteristics are extremely similar from pro- 

40 caryotes to higher animals, suggesting a common mechanism for protein secretion. (M.S. Briggs and LM. 
Qierasch, Adv. Protein Chem., 38, 109-180 (1986); G. von Heijne, EMBO J., 3, 2315-2318 (1984)). 

Human serum albumin is encoded on the gene as a prepro type "protein (see Japanese Patent 
Application (OPI) No. 29985/87 (the term OPI used herein means an unexamined published application.) or 
EP-A-206733; A Dugaiczyk et al. Proc. Natl. Acad. Sci. USA, 79, 71-75 (1982)). The DNA and amino acid 

45- sequence in the vicinity of the N-terminal of mature human serum albumin beginning from the signal 
peptide essential for secretion are shown in Table 1 below. 
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The singal peptide, composed of 18 amino acid is removed at the time of secretion. The propeptide, 
composed of 6 amino acids, is removed by processing, and mature human serum aibumin, composed of 

55 585 amino acids, and having an N-terminal amino acid sequence of Asp-Ala-His-Lys-Ser is obtained. 

Since yeast secrete less-extracellular proteases and are capable of adding sugar moieties to its secreta, 
yeast is excellent for the secretory expression of foreign proteins. 

Several cases of signal peptides which contributes to the secretory expression in cells other than yeast. 
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but which also function in yeast, have been reported. Examples include the secretory expression in yeast of 
human lysozyme using the chicken lysozyme signal peptide (Jigami, BIOINDUSTRY, 4. 117-123 (1987)), 
secretory expression in yeast of thaumatin using the signal peptide for plant protein thaumatin (L Edens, I. 
Bom, A.M. Ledeboer, J. Maat ( M.Y. Toonen, C. Visser and C.T. Verrips, Cell. 37. 629-633 (1984)), and 
s secretory expression in yeast of human interferon using the signal peptide for human interferon-a (RA 
Hitzeman, D.W. Leung. LJ. Perry, W.J.. Kohr,' H.L Levine and D.V. Doeddel, Science, 219, 620-625 (1983)). 

The truth is, however, that the signal peptide contributing to secretory expression in cells other than 
yeast does not always function in yeast. 

10 SUMMARY OF THE INVENTION 



Therefore, a primary object of this invention is to provide a method for expressing and secreting foreign 
T5 protein efficiently in yeast, the signal peptide gene functionable in yeast for secretory expression, the vector 
to be used in this method, and the transformant transformed by this vector. 

The above-described object of the present invention has been met in one embodiment by a method for 
preparing foreign protein comprising expressing and secreting said foreign protein by yeast transformed by 
a recombinant DNA comprising the serum albumin signal peptide gene adjacent to the gene of said foreign 
20 protein. In a second embodiment the present invention relates to a serum albumin signal peptide gene and 
derivatives thereof. In a third embodiment, the present invention relates to a recombinant DNA for 
transforming yeast comprising DNA encoding the serum albumin signal peptide adjacent to DNA encoding 
a foreign protein. In a forth embodiment, the present invention relates to a strain of yeast transformed by a 
recombinant DNA comprising DNA coding for the serum albumin signal peptide adjacent to DNA encoding 
25 a foreign protein. 

BRIEF DESCRIPTION OF THE DRAWINGS 



Fig. 1 shows the procedure for making pGAL12 from pGAL11 possessing the GAL1, 10 promoters. 

Fig. 2 shows the procedure for making pPT1, containing only the phoS terminator, from pAP5 and 
pUC9 containing the entire pho5 gene. 

Fig. 3 shows the procedure for making pPT2 from pJDB207. 
35 Rg. 4 and Fig. 5 show the restriction enzyme map of pGX401 containing the prepro human serum 

albumin gene. 

Rg. 6 shows the procedure for making pHSA2, containing the human serum albumin gene C-terminal 
side from pGX401 and pUC19. 

Rg. 7 shows the procedure for making pHSA1. containing the human serum albumin gene N-terminal 

40 side, from pGX401 and pUCl9. 

Rg. 8 shows the procedure for making pNH001, containing the signal peptide gene and the mature 
human serum albumin gene, from pHSA1, pHSA2 and the synthesized signal peptide gene. 

Rg. 9 shows the procedure for making pNH007, containing the GAL1 promoter, signal peptide gene 
and mature human serum albumin gene, form pNH001 and pGAL12. 
45 Rg. 10 shows the procedure for making pNH008 t containing the GAL1 promoter, signal peptide gene, 

mature human serum albumin gene and phoS terminator, from pNH007 and pPT2. 



DETAILED DESCRIPTION OF THE INVENTION 

50 

The recombinant DNA of this invention comprises the serum albumin signal peptide gene, the foreign 
protein gene, a promoter, a terminator, and the plasmid DNA or chromosome DNA. 

The origin of the serum albumin signal peptide gene is not specifically defined as long as it is derived 
55 from mammals. Practically, human-derived, rat-derived and bovine-derived preparations can be used. 

Examples of the amino aicd sequences of such signal peptides are known to include; 
Met Lys Trp Val Thr Phe He Ser Leu Leu Phe Leu Phe Ser Ser Ala Tyr Ser derived from humans; 
Met Lys Trp Val Thr Phe Leu Leu Leu Leu Phe He Ser Gly Ser Ala Phe Ser derived from rats; and 
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Met Lys Trp Val Thr Phe lie Ser Leu Leu Leu Leu Phe Ser Ser Ala Tyr Ser derived from cows. 

However, preferably, the human serum albumin signal peptide gene is used and the 2nd amino acid 
and the last five amino adds can be changed by Y and Xs as the following sequence. 
Met Y Trp Val Thr Phe He Ser Leu Leu Phe Leu Phe Xs X* Xa X 2 Xi 

wherein Y represents lys, Arg or His and preferably represents Lys; Xs represents Ala. Pro or Ser; X* 
represents Lys, Qly or Ser, Xa represents Ala, Val or Cys and preferably represents Val or Cys; X2 
represents Tyr, Trp or Sen and X1 represents Ser, Ala or Gly and preferably represents Aia or Gly. 
Preferable examples of amino acid sequences of the signal peptides are shown in Table 2 below. 

Table 2 



Sequence No. 


Y 


Xs 


X4 


Xg 


x 2 


X1 


Sequence 1 


Lys 


Ser 


Ser 


Val 


Tyr 


Ala 


Sequence 2 


Lys 


Ala 


Lys 


Val 


Ser 


Ala 


Sequence 3 


Lys 


Pro 


Gly 


Cys 


Trp 


Ala 


Sequence 4 


Lys 


Pro 


Gly 


Val 


Trp 


Ala 



The serum albumin signal peptide gene may possess a DNA sequence which can be expressed by the 
amino acid sequence shown above, and one example is having the following DNA sequence. 
ATGAAGTGGGTAACCTTTATTTCCCTT 
CTTTTTCTCTTTAGCTCGGCTTATTCC 

Preferable codons corresponding to each amino acid are set forth below. 



Ala: GCT or GCC. 


Cys: TGT, 


Asp: GAC, 


GIu: GAA, 


Phe: TTC, 


Gly: GGT t 


His: GAC, 


lie: ATT or ATC, 


Lys: AAG. 


Leu: TTG, 


Met ATG, 


Asm AAC, 


Pro: CCA, 


Gin: CAA, 


Arg: AGA, 


Sen TCT or TCC, 


Thn ACT or ACC, 


Val: GTT or GTC, 


Trp: TGG, 


Tyr TAC 





As the foreign protein in this invention, human serum albumin, interferon-a, -0, or -7, urokinase, growth 
hormone, insulin, Factor VIII, EPO, h-ANP f M-CSF and various lymphokines may be used. 

In the case of human serum albumin, pre type, pro type, or prepro type may be used, and in the case 
of urokinase, pro type or any other type may be used. Among foreign proteins, in particular, a mature 
human serum albumin gene is preferable. According to the present invention, in the case that the mature 
human serum albumin gene Is positioned immediately downstream to the serum albumin signal peptide 
gene, a substantial quantity of albumin car* be produced. 

Such foreign protein genes have been described in Japanese Patent Application (OPI) No. 29935/87 or 
EP-A-206733 (human serum albumin), Japanese Patent Application (OPI) No. 185189/86 or DE-A-3603958 
(interferon-a), Japanese Patent Application (OPI) No. 108397/86 or EP-A-1 90686 (interferon-^), Japanese 
Patent Application (OPI) No. 180591/85 or EP-A-1 54272 (urokinase), EP-A-1 60457 (Factor VIII), EP-A- 
148605 (EPO), WO85-4670 (h-ANP), WO86-4607 (M-CSF), and others. 

In the above publications, the inventions are described as plasmids containing foreign protein genes. 

The recombinant DNA for transforming yeast in this invention is prepared by linking the foreign protein 
gene downstream to the serum albumin signal peptide gene. 

The promoter and terminator are not specifically limited to those found in yeast. 

Acceptable promoters include PGK promoter (Nucleic Acid Res., 10(23) , 7791 (1982)). ADH promoter 
(ibid.), phoE (5) promoter (J. MOI. Biol.. 163(4) , 513 (1983)), GAL1 promoter (Mol. Cell. Biol.. 4(11) , 2467 
(1984)). GAL10 promoter (EP-A-132309) and GAP-DH promoter (J. Biol. Chem., 258, 5291 (1983)). Among 
these promoters, GAL1 promoter is particularly preferable. 

The promoter is positioned upstream to the serum albumin signal peptide gene. 

Acceptable terminators include the phoE(5) terminator (Cell. 12, 721-732 (1977)) and the GAP-DH 
terminator (J. Biol. Chem., 254, 9839-9845 (1979)). 

The terminator is positioned downstream to the foreign protein gene. 
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The promoter and terminator may be obtained in a form already incorporated into plasmids. 
The plasmid DNA must be capable of self-replication in yeast. 
Acceptable examples are pJDB207 (Amersham) and pJDB219 (Amersham). 

The recombinant plasmid of this invention is obtained either by cleaving a DNA sequence composed of 
5 the serum albumin signal peptide gene-foreign protein gene, a DNA sequence containing the promoter, and 
a DNA sequence containing the terminator from the above plasmid groups by a restriction enzyme and 
coupling (connecting) them to incorporate them into a proper plasmid. or by cleaving one DNA sequence 
and then incorporating it into another plasmid. 

Also, the recombinant chromosome of this invention is obtained by insertion of a DNA sequence 
10 comprising the serum albumin signal peptide gene-foreign protein gene, a DNA sequence containing the 
promoter, and a DNA sequence containing the terminator into the yeast chromosome. The detail methods 
have been described in Proc. Natl. Acad. Sci. USA, 78, 6354-6358 (1981) and Method Enzymol., 1(M. 228- 
245 (1983). 

The DNA sequence on the plasmid or the chromosome is arranged, from upstream to downstream, in 
75 the order of the promoter, serum albumin signal peptide gene, foreign protein gene, and terminator. 

As the marker for selecting the desired plasmid, it is also possible to incorporate an antibiotic 
(tetracycline, ampicillin, kanamycin) resistance gene, or a gene to compensate for a nutritional requirement 
of the host. The method of preparing a transformant by this recombinant plasmid or the method of 
preparing a foreign protein is as follows. 
20 The recombinant plasmid is introduced into the host cell i.e., yeast. Practically, a strain having a 
variation which is complemented by the selective marker gene carried by the plasmid to be inserted, for 
example, Saccharomyces cerevisiae AH22 (a, his4, Ieu2, can1) which is a leucine-requiring variant is 
acceptable for use. 

Transformation of the host cell (yeast) is conducted by an established method, for example, the calcium 
25 phosphate sedimentation method, protoplast-polyethylene glycol fusion method, electropolation method. 

The transformant is incubated in an established culture medium for the growth of the host cell. Practical 
examples of culture medium are YNB liquid culture medium (0.7 w/v% yeast nitrogen base (Difco Co.) and 
2 w/v% glucose), YPD liquid culture medium (1 w/v% yeast extract (Difco), 2 w/v% polypeptone (Daigo 
Eyo Sha), 2 w/v% glucose) and others. 
30 Incubation is performed for 20 to 100 hours, usually at 15 to 43* C (preferably about 30* C), while being 
aerated or stirred as required. 

After cultivation, the culture supernatant is recovered, and the foreign protein is purified by an 
established method, such as affinity chromatography or fractionation. 

By using the method of this invention, a desired foreign protein can be produced by secretory 
35 expression. Compared with the system intracellular accumulation, production of the protein possessing 
structure and function much close to the native protein may be expected. 

Additionally, in the system of intracellular accmulation, it is necessary to destroy the ceils to refine the 
protein and to purify the protein from the liquid which contains debris, but this type of purification process Is 
unnecessary when the method of this invention is used. 
40 The use of the serum albumin signal peptide in expression of the protein also allows the development 
of the new secretory expression method. This increases the potential usefulness of this invention consider- 
ably. 

This invention is described in further detail below by referring to the following Example, which, however, 
is not intended to limit this invention in any respect 
45 Many of the techniques, reactions and analytical methods used In this invention are well known in the 
art. Unless otherwise specified, ail enzymes can be obtained from commercial supply sources: for example, 
Takara Shuzo, Japan; New England Biolabs (NEB), Massachusetts, USA; Amersham, England; and 
Bethesda Research Laboratories (BRL), Maryland, USA. 

Buffer solutions for enzymatic reactions and reaction conditions conformed to the recommended 
so specifications of the manufacturers of the enzymes unless otherwise noted. 

The transformation method of Escherichia coli by plasmid, colony hybridization, electrophoresis, and 
DNA recovery method from gels were conducted in accordance with the methods mentioned in "Molecular 
Cloning", Cold Spring Harbor Laboratory (1982). Yeast was transformed by the method stated in "Method in 
Yeast Genetics", Cold Spring Harbor Laboratory (1981). 

56 

EXAMPLE 
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Cloning of yeast GAL1, 10 promoters * 



(A) Preparation of yeast chromosomal DNA library 

5 

The chromosomal DNA of the yeast Saccharomyces cerivisiae GRF18 PHO80 cir° strain (as described 
in EP-A- 0180958 was extracted and purified by the method described by R. Cryer et al., (Method 
Enzymol.. 12, 39 (1975)). 

According to M. Mohnson and R.W. Davis (MoL Cell. Biol., 4, 1440-1448 (1984)), the yeast GAL1, 10 

io promoter regions are located on the yeast chromosome, and when it is digested by the restriction enzymes 
EcoRI and Xbal, DNA segments of about 1 kb are obtained. Hence, yeast chromosomal DNA, extracted and 
purified as "described above, was digested by Eco RI and Xbal, and DNA segments of about 1 kb were 
isolated by electrophoresis. These segments were mixed with plasmid pUC19 (BRL) which was digested by 
EcoRI and Xbal, and dephosphorylated at its 5 terminal with alkaline phosphatase derived from caif 

is Intestines (CIpT These were ligated using the ligation kit (Takara Shuzo). This product was introduced into 
Escherichia coli JM109 (Takara Shuzo). The transformant was applied to a YT agar plate containing 0.004 
w/v% 3£gaT (5-bromo-4-chloro-3-indolyl-j8-galactoside) and 1 mM . IPTG (isopropyl-/3,D-th- 
iogalactopyranoside), and was incubated overnight at 37 *C. (To prepare the agar plate 8 g of polypeptone, 
5 g of yeast extract, and 5 g of sodium chloride were dissolved in water to make up 1 liter and 12 g of agar 

20 powder was added. After sterilizaiton in an autoclave, the mixture was dispensed into plastic Petri dish and 
solidified; X-gal and IPTG were added after autoclaving once the culture medium had cooled.) 

White and blue colonies appeared, and only the white cionies having the DNA inserts were used. (The 
desired transformant produced white colonies since the recombinant plasmid inserted therein had no lac Z 
gene.) One hundred colonies were inoculated onto an L-agar plate containing 40 u.g/ml ampiciilin by a 

25 sterilized toothpick. (To prepare the agar plate 0.62 g of tris base, 10 g of polypeptone, 5 g of yeast extract, 
and 5 g of sodium chloride were dissolved in water to make up 1 liter, and 12 g of agar powder was added. 
The mixture was sterilized in an autoclave, dispensed into plastic Petri dish and solidified: ampiciilin added, 
after autoclaving once the medium had cooled.) This L-agar plate was incubated overnight at 37* C. By this 
method, a library consisting of about 5,000 colonies was prepared. The formed colonies were transferred to 

so a nitrocellulose filter, dipped in a solution of 0.5 M sodium hydroxide and 1.5 M sodium chloride to denature 
the DNA, and were neutralized in a solution of 1.5 M sodium chloride and 0.5 M tris-hydrochloric acid at pH 
7.5. The E. coli debris was washed with 2 x SSC (0.3 M sodium chloride, 0.03 M sodium citrate at pH 7.0) 
and removedTand after drying the filter in air, it was subjected to vacuum drying for 2 hours at 80* C. 

35 

(B) Preparation of the probe 

Part of base sequence of the gene coding for the GAL1, 10 promoters was synthesized by the 
phosphoramidite method using a DNA synthesizer, Applied Biosystem Co. model 381 A. Its sequence is 
40 shown below. 

5 -CTCTATACTTTAACGTCAAG-3' * 

The sequence was subjected to electrophoresis using 7 M urea-20 w/v% polyacrylamide gel and 
purified. The 5 terminal of the purified DNA sequence was labeled radioactively by [-r 32p ] ATP and T4 
polynucleotide kinase. The reaction using 10 pmoles of synthetic DNA, 250 uCi of tr- 32p ] ATP, and 8 units 
45 of T4 polynucleotide kinase, resulted in a synthetic DNA probe terminally labeled with 32(1 (2 x 10 7 cpm 
(Cerenkov count)). The synthetic DNA probe was purified by NENSORB 20 (Du Pont). 



(C) Serving of GAL1, 10 promoters 

so 

Nitrocellulose filters having the DNA fixed as described in step (A) were placed in vinyl bags with each 
set containing 10 filters; and the following process carried out. Ten milliliters of prehybridization solution 
composed of 6 x SSC, 0.1 w/v% SDS, and 20 ug/mi of salmon sperm DNA cooled on ice after heating for 

5 minutes at 100* C was put in a vinyl bag which was sealed and incubated for 3 hours at 40 C. The 
55 prehybridization solution was then discarded and 10 ml of hybridization solution was added and incubated 

overnight at 40* C. The hybridization solution contained 6 x SSC, 0.1 w/v% SDS, 100 ug/ml salmon sperm 
DNA, and 7.5 x 10 s cpm/rnl ^-probe. After incubation, the filter was transferred to a beaker and washed in 

6 x SSC, 0.1 w/v% SDS for 30 minutes at 50* C, in 2 x SSC and 0.1 w/v% SDS for 30 minutes at 50* C, in 
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2 x SSC and 0.1 w/v% SDS for 30 minutes at 50* C, and finally in 0.1 x SSC and 0.1 w/v% SDS for 30 
minutes at 50* C. The washed filter was dried in air and subjected to autoradiography after applying 
spotting marks of 100-200 cpm. As a result, two positive clones were obtained. One of the clones was 
subjected to shaking culture overnight at 37* C in super tfroth containing 40 ug/ml of ampicillin. {To 

s prepare the super .broth 12 g of bactotrypton, 24 g of yeast extract, and 5 ml of glycerol were dissolved in 
water to make up 900 mt, which was sterilized by autoclave to obtain solution A. Then, 3.81 g of potassium 
dihydrogen phosphate and 12.5 g of potassium monohydrogen phosphate were dissolved in water to make 
up 100 ml, which was sterilized by autoclave to obtain solution B. These solutions A and B were mixed in a 
ratio of 9:1 by v/v.) Then, the plasmid DNA was extracted and purified by the alkaline-SDS method. 

to When part of the base sequence of this plasmid DNA (pGAUl, Fig. 1) was examined by the dideoxy 
method, the results coincided with the reported sequence by M. Johnston and R.W. Davis (Mol. Ceil, Biol., 
4, 1440-1448, (1984)). That is, it was found that pGAL11 possessed the GAL1 promoter in the direction of 
the Xbai site from the EcoRI site, and the GAL10 promoter in the opposite direction. 

15 

(D) Conversion of pGAL1 1 from the Xbal site to the Bam HI site 

When ligating the promoter sequence on pGAL1 1 with the DNA sequence coding for the signal peptide 
and human serum albumin, it is not convenient to have an intervening Xbal site because the Xbal site is 
20 present on the human serum albumin gene. Therefore, the Xbal site was converted to the BamHI site as 
follows. 

After digesting pGALl 1 by Xbal, the sticky end was repaired by E. coli-derived DNA polymerase I, 
Klenow fragment, in the presence of dGTP, dATP, dTTP, dCTP. To this DNA fragment, the Bam HI linker 
pCGGATCCG having a phosphorylated 5' terminal was added and was ligated by T4 DNA ligase. After then 
25 digesting with BamHI, ligation was again carried out with T4 DNA ligase and the resulting plasmid 
introduced into E. coH HB101 (EP-A-13828). From the resulting transformants, a clone having plasmid 
pGAL12 (as shown in Fig. 1) was obtained. By digesting pGAL12 by Eco RI and BamHI, the GAL1 and 
GAL10 promoters could be isolated as a DNA fragment of about 1 kb. 

so 

(E) Preparation of E. coli-yeast shuttle vector pPT2 possessing a yeast pho5 terminator 

The plasmid pAP5 which has encoded the Saccharomyces serevisiae pho5 gene is disclosed in 
Japanese Patent Application (OPI) No. 151183/87 or EP-A-216573. This plasmid was digested by the 

35 restriction enzymes Sau3AI and Pstl, and the DNA fragment which has encoded the pho5 terminator, about 
370 bp, was isolated~by electrophoresis (Fig. 2). The commercially available pUC9 (BRL) was then digested 
with Bam HI and Pstl, treated with alkaline phosphatase, and ligated with the 370 bp DNA fragment. The 
base sequence at the Sau3AI cleavage site of the 370 bp fragment was 
GATCC 

40 G 

and when it was ligated with the sticky end of the Bam HI, the Bam HI site was regenerated. Therefore, by 
digesting plasmid pPT1 obtained in the above ligation reaction with Bam HI and Pstl, or by digestion with 
Bam HI and Hindlil, a DNA fragment possessing a 370 bp pho5 terminator was obtained (Fig. 2). The 
commercially available shuttle vector pJDB207 (Fig. 3) is self-replicating in E. coli and yeast. After digestion 

45 with Bam HI and Hindlil, it was treated with alkaline phosphatase. After digesting pPT1 with Bam HI and 
Hindlil, the DNA fragment having the 370 bp pho5 terminator was isolated by electrophoresis and was 
ligated with pJDB207. From the resulting transformants, a clone having plasmid pPT2 (as shown in Fig. 3) 
Was obtained. pPT2 is an E. coli-yeast shuttle vector possessing a pho5 terminator. In E. coH, it possesses 
an ampicillin resistance markerwith 0-lactamase activity and in yeast it has a marker to compensate for a 

so leucine nutritional requirement. 



(F) Human serum albumin gene 

55 The DNA sequence coding for human serum albumin was derived from the plasmid pGX401 (Figs. 4 
and 5) disclosed in Japanese Patent Application (OPI) No. 29985/87 or EP-A-206733 as follows. pGX401 
was digested with the restriction enzymes Xbal and Hindlil, and the DNA fragment (HSA2) of about 750 bp 
coding for the C-terminal side 357 Leu to ^^^Leu of the amino acid sequence of human serum albumin, 

9 
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including the 3' untranslated region, was isolated by electrophoresis. The commercially available plasmid 
pUC19 was digested with Xbal and Hindlll, was treated with alkaline phosphatase to dephosphorylated the 
5' terminal and was ligated with HSA2 with T4 DNA ligase. It was introduced into E.'coli HB101, and from 
the resulting transfomnants, a clone having plasmid pHSA2 (as shown in Fig. 6) was obtained. 

5 Upon digesting pGX401 with Dral and Xbal, a DNA fragment of about 1 kb was isolated by 
electrophoresis. This DNA fragment is the DNA sequence encoding for the N-terminal side 12 Lys to 35e Thr 
of the amino acid sequence of human serum albumin. 

Using the DNA synthesizer Applied Biosystem model 381 A, the following DNA sequence encoding for 
the N-terminal 1 Asp to M Phe of the amino acid sequence of mature human serum albumin was synthesized 

70 by the phosphoramidite method. 
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The codon for aspartic acid (Asp) in pGX401 was GAT, but GAC was used here. As a result after 
20 ligating the synthetic DNA with the 1 kb DNA fragment derived from pGX401 , the Sa[i site was regenerated 
when it was inserted into the SaN-Xbal site of pUC19. Furthermore, when digested with HinCII, the DNA 
sequence coding for the amino acid sequence starting from the N-terminal 'Asp of mature human serum 
albumin was obtained. 

The 5' terminal of the synthetic DNA was phosphorylated by ATP and T4 polynucleotide kinase. 
25 pGX401 was digested with Dral and Xbal and 1 kb DNA fragment was isolated by electrophoresis. This 
fragment and the phosphorylated synthetic DNA were ligated with T4 ligase, digested with Sa[l and Xbal , 
and then ligated with pUC19 which was digested with Sail and Xba l and dephosphorylated by CIP. The 
resulting DNA was introduced into E. coii HB101, and from the transfomnants, a clone having the plasmid 
pHSA1 (as shown in Fig. 7) was obtained. 

30 

(G) Preparation of plasmid DNA for expressing and secreting human serum albumin in yeast 

The DNA sequence shown in Table 3 below coding for the signal peptide of human serum albumin was 
35 synthesized by the phosphoramidite method by the DNA synthesizer Applied Biosystem model 381 A. 
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Also, the DNA sequence encoding the signal peptide amino acid which was changed to Arg or His, Ala 
or Pro, Lys or Gly, Vai or Cys, Trp or Ser, Aia or Gly in the place of -17, -5. -4, -3, -2 and -1 ( respectively, 
was synthesized by the same method (cf. Table 2). The changed DNA sequence lead to produce and 
55 secrete the more proper N-terminal side of albumin. 

The 5' terminal of the synthetic DNA was phosphorylated with ATP and T4 polynucleotide kinase. 
pHSA1 was digested with Xbal and Hindi, and the 1kb HSA1 DNA fragment encoding for the N-terminal 
side of human serum albumin was isolated by electrohphoresis. The phosphorylated synthetic DNA and 



11 



EP 0 319 641 A1 



HSA1 were mixed and ligated with T4 DNA ligase, and digested further with Xbat and BamHI. After 
digesting pHSA2 with Xbal and BamH I, it was treated with alkaline phosphatase. After mixing, these DNAs 
were ligated with T4 DNA ligase and introduced into E. cofi HB101 cells. Among the resulting transformants 
a clone having the plasmid pNH001 (as shown in Rg.7*) was obtained. 

5 After digesting pNH001 with EcoRI and Bam HI, it was treated with alkaline phosphatase. Then, pGAL12 
was digested with EcoRI and Bam HI, a DNA fragment of 1 kb possessing the GAL1 promoter was isolated 
by electrophoresisTmixed with the treated pNH001 and ligated with T4 DNA ligase. From the resulting 
transformants, a clone having the plasmid pNHOOT (as shown in Fig. 9) was obtained. pNH007 is a plasmid 
DNA having the DNA sequence encoding for the human serum albumin signal peptide located downstream 

io from the GAL1 promoter, the DNA sequence encoding for mature human serum albumin immediately after 
it. and immediately following that the 3' untranslated region derived from human serum albumin cDNA 
inserted in the EcoRI-Hindlll site of pUC19. 

After digesting pNHOOT with Eco RI and Hindlll, a DNA fragment of 2.7 kb coding for the GAL1 
promoter, the signal peptide, mature human serum albumin and the untranslated region was isolated by 

is electrophoresis. Additionally, pPT2 was digested with Bam HI and treated with alkaline phosphatase. It was 
mixed with the 2.7 kb DNA fragment, and the sticky end was repaired by DNA poymerase I, Klenow 
fragment, in the presence of dATP, dGTP, dTTP. and dCTP. After ligation with T4 DNA ligase, it was 
introduced into E. coll HB101. From the resulting transformants, a clone having the plasmide pNH008 (as 
shown in Fig. lOjwas obtained. 

20 pNH008 is a plasmid capable of self-replication in E. coli and yeast and possesses the DNA sequence 
encoding for the human serum albumin signal peptide and the succeeding mature human serum slbumin 
protein under the control of the GAL1 promoter functionable in yeast Furthermore, pNH008 also possesses 
a gene for ampiciilin resistance in E. coli. and a gene for fulfilling the nutritional requirement for leucine in 
yeast, and these genes can be used as selective marker for transformants. 

25 

(H) Introduction of plasmid pNH008 Into Yeast 

Plasmid pNH008, for the secretory expression of human serum albumin, was introduced into yeast 
30 Saccharomyces cerevisiae AH22 (Proc. Nati. Acad. Sci. USA, 75, 1929-1933 (1978)) by the following 
method. 

S. cerevisiae AH22 was subjected to shaking culture overnight at 30* C in 50 ml of YPD medium. (To 
prepare the medium, 10 g of yeast extract and 20 g of bactopeptone were dissolved in water to make up 
900 ml, which was sterilized in an autoclave and mixed with 100 ml of 20 w/v% glucose separately 

35 sterilized in an autoclave). The cells were precipitated by centrifugation, resuspended in 20 ml of water, 
and centrifuged again. Next, the cells were suspended in 10 ml of 50 mM dithiothreitol, 1.2 M sorbitol, 2 
mM EDTA at pH 8.5, and were shaken slowly for 10 minutes at 30* C. The cells were collected by 
centrifugation, and suspended in 10 ml of 1.2 M sorbitol, then centrifuged again for collection. The cells 
were suspended in 10 ml of 0.2 mg/ml zymolyase 100T, 1.2 M sorbitol, 10 mM EDTA. 0.1 M sodium 

40 citrate at pH 5.8, and were shaken slowly for 1 hour at 30* C. The cells were collected by centrifugation and 
washed in 10 ml each of 1.2 M sorbitol, 10 mM calcium chloride and 1.2 M sorbitol, sequentially, and 
again the cells were collected by centrifugation. The cells were suspended in 1 ml of 10 mM calcium 
chloride and 1.2 M sorbitol. One hundred microliter aliquotes of suspension were placed in a sterile test 
tube and mixed with 5 w.1 (5 lag) of pNH008; the mixture was allowd to stand for 15 minutes at room 

45 temperature. After this, it was mixed with 12 ml of 20 w/v% polyethylene glycol 4,000. 10 mM calcium 
chloride. 10 mM tris-hydrochloride at pH 7.5, and after gentle mixing, the mixture was let stand at room 
temperature for 20 minutes. The cells were collected by centrifugation, suspended in 0.1 ml of YPD 
medium containing 1.2 M sorbitol and 10 mM calcium chloride, and shaken gently for 30 minutes at 30 *C. 
1, 5, 10, 20 and 50 ill of suspension were suspended in 45* C-controlled 10 ml of 1.2 M sorbitol, 3 w/v% 

so noble agar, 2 w/v% glucose, and 0.7 w/v% yeast nitrogen base and were spread over plates composed of 
1.2 M sobitol, 3 w/v% bactoagar, 2 w/v% glucose, and 0.7 w/v% yeast nitrogen base. After the plates 
solidified, they were subjected to stationary culture for 3 days at 30* C. Formed colonies were collected by 
a sterile toothpick suspended in 3 ml of 0.7 w/v% yeast nitrogen base and 2 w/v% glucose, and subjected 
to shaking culture for 2 days at 30* C. One and a half milliliters of suspension was centrifuged, and the cells 

55 were collected and suspended in 3 ml of YPG medium. (To prepare the culture, 10 g of yeast extract and 
20 g of bactopeptone were dissolved in water to* make up 900 ml, sterilized in an autoclave, and mixed 
with 100 ml of 20 w/v% galactose, sterilized separately in an autoclave.) This was subjected to shaking 
culture at 30* C. The human serum albumin concentration in the culture supernatant was measured by the 
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RPHA method (as described in European Patent 122,620), and a maximum human serum albumin of' 10 
. ug/ml was detected on the first day. 

s (I) Cultivation of yeast for the expression and secretion of human serum albumin 

The yeast S. cerevisiae AH22 for the expression and secretion human serum albumin tranformed by 
pNH008 as mentioned above was cultivated by the following procedure. The recombinant yeast was grown 
in a plate containing 0.7 w/v% yeast nitrogen base, 2 w/v% glucose and 3 w/v% bactoagar and collected by 

70 a platinum loop. It was inocuiated into YNB medium 50 mt composed of 0.7 w/v% yeast nitrogen base and 
2 w/v% glucose and incubated for 2 days at 30* C. The whole volume was inoculated into 500 mt of YNB 
medium and incubated for 2 days at 30'C. The ceils were collected by centrifugation, and suspended in 
500 ml of YPG medium, and subjected to shaking culture at 30* C. A portion of the culture broth was 
collected after 0, 3 ( 6, 24 and 48 hours of incubation, and the culture supernatant was obtained by 

T5 centrifugation. The concentration of human serum albumin secreted into the culture broth was measured by 
the RPHA method. Secretory expression of human serum albumin was detected beginning the third hour 
after the start of incubation, and the concentration of human serum albumin in the supernatant was 0.25 
mg/t at 6 hours, 20 mg/l at 24 hours, and 160 mg/I at 48 hours. 

20 

Claims 

1 . A method for preparing foreign protein comprising expressing and secreting said foreign protein by 
yeast transformed by a recominant DNA comprising the serum albumin signal peptide gene adjacent to the 

25 gene of said foreign protein, a promoter upstream to said serum albumin signal peptide gene and a 
terminator downstream to said foreign protein gene. 

2. The method as set forth in Claim 1, wherein the serum albumin signal peptide is human derived. 

3. The method as set forth in Claim 1, wherein the serum albumin signal peptide gene is expressed in 
the following amino acid sequence. 

30 Met Y Trp Val Thr Phe lie Ser Leu Leu Phe Leu Phe 
Xs X* X3 Xz X1 

wherein Y represents Lys, Arg, or His; Xs represents Ala, Pro or Ser; X* represents Lys, Gly or Ser; X3 
represents Ala, Val or Cys; X 2 represents Tyr, Trp or Sen and X1 represents Ser, Ala or Gly. 

4. A serum albumin signal peptide gene encoding the following amino acid sequence. 
35 Met Y Trp Val Thr Phe lie Ser Leu Leu Phe Leu Phe 

Xs X* X3 X2 Xi 

wherein Y represents Lys, Arg or His; Xs represents Ala, Pro or Ser; X* represents Lys, Gly or Ser; X3 
represents Ala, Val or Cys; X2 represents Tyr, Trp or Ser; and X1 represents Ser, Ala or Gly. 

5. A recombinant DNA for transforming yeast comprising DNA encoding the serum albumin signal 
40 peptide adjacent to DNA encoding a foreign protein, a promoter upstream to the serum albumin signal 

peptide gene and a terminator downstream to the foreign protein gene. 

6. A strain of yeast transformed by a recominant DNA comprising DNA encoding the serum albumin 
signal peptide adjacent to DNA encoding a foreign protein, a promoter upstream to the serum albumin 
signal peptide gene and a terminator downstream to the foreign protein gene. 

45 7. The method as set forth in Claim 1 , wherein the amino acid sequence of said serum albumin signal 

peptide is selected from the group consisting of: 

MetLysTrpValThrPhelleSerLeuLeuPheLeuPheSerSerAJaTyrSer, 

MetLysTrpValThrPheLeuLeuLeuLeuPhelleSerGlySerAIaPheSer. 

MetLysTrpValThrPhelleSerLeuLeuLeuLeuPheSerSerAlaTyrSer, 
50 MetLysTrpValThrPhelleSerLeuLeuPheLeuPheSerSerValTyrAla, 

MetLysTrpValThrPhelleSerLeuLeuPheLeuPheAlaLysValSerAla, 

MetLysTrpValThrPhelleSerLeuLeuPheLeuPheProGlyCysTrpAla, 

and 

MetLysTrpValThrPhelleSerLeuLeuPheLeuPheProGlyValTrpAla. 
55 8. The method as set forth in Claim 1, wherein said serum albumin signal peptide gene has the 
following DNA sequence: 
ATGAAGTGGGTAACCTTTATTTCCCTT 
CTTTTTCTCTTTAG CTCG G CTTATTCC. 
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9. The method as set forth in Claim 1, wherein said foreign protein is selected from the group 
consisting of human ssrum albumin, interferon-a, interferon-0, interferon-7, urokinase, growth hormone, 
insulin, lymphokines, h-ANP, Factor VIII, CSFs and EPO. 

10. The method as set forth in Claim 1, the foreign protein gene is the mature'human serum albumin 
s gene which is positioned immediately downstream to the serum albumin signal peptide gene. 

Claims for the following Contracting State: ES 

to 1. A method for preparing foreign protein comprising expressing and secreting said foreign protein by 
yeast transformed by a recombinant DNA comprising the serum albumin signal peptide gene adjacent to 
the gene of said foreign protein, a promoter upstream to said serum albumin signal peptide gene and a 
terminator downstream to said foreign protein gene. 

2. The method as set forth in Claim 1. wherein the serum albumin signal peptide is human derived. 

75 3. The method as set forth in Claim 1, wherein the serum albumin signal peptide gene is expressed In 
the following amino acid sequence. 
Met Y Trp Val Thr Phe lie Ser Leu Leu Phe Leu Phe 
Xs X* Xa X2 Xi 

wherein Y represents Lys, Arg or His; Xs represents Ala, Pro or Sen X4 represents Lys, Gly or Ser, X3 
20 represents Ala, Val or Cys; X2 represents Tyr, Trp or Ser; and Xi represents Ser, Ala or Gly. 

4. The method as set forth in Claim 1, wherein the amino acid sequence of said serum albumin signal 
peptide is selected from the group consisting of: 
MetLysTrpVaJThrPheileSerLeuLeuPheLeuPheSerSerAIaTyrSer, 
MetLysTrpVaiThrPheLeuLeuLeuLeuPhelleSerGlySerAlaPheSer, 

25 MetLysTrpValThrPhelleSerLeuLeuLeuLeuPheSerSerAlaTyrSer, 
MetLysTrpVaiThrPhelleSerLeuLeuPheSerSerValTyrAla, 
MetLysTrpValThrPhelieSerLeuLeuPheLeuPheAlaLysValSerAla, 
MetLysTrpValThrPhelleSerLeuLeuPheLeuPheProGlyCysTrpAla, 
and 

30 MetLysTrpValThrPhelleSerLeuLeuPheLeuPheProGlyValTrpAla. 

5. The method as set forth -in Claim 1, wherein said serum albumin signal peptide gene has the 
following DNA sequence: 

ATGMGTGGGTMCCTTTATTTCCCTT 
CTTTTTCTCTTTAGCTCGGCTTATTCC. 
35 6. The method as set forth in Claim 1, wherein said foreign protein is selected from the group 
consisting of human serum albumin, interferon-a, interferon-^, interferon-7, urokinase, growth hormone, 
insulin, lymphokines, h-ANP, Factor VIII, CSFs and EPO. 

7. The method as set forth in Claim 1, the foreign protein gene is the mature human serum albumin 
gene which is positioned immediately downstream to the serum albumin signal peptide gene. 
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